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Abstract 

Three player quantum Kolkata restaurant problem is modeled using 
three entangled qutrits. This first use of three level quantum states in 
this context is a step towards a A-choice generalization of the A-player 
quantum minority game. It is shown that a better than classical payoff 
is achieved by a Nash equilibrium solution where the space of available 
strategies is spanned by subsets of SU(3) and the players share a tripartite 
entangled initial state. 

Keywords: Quantum information theory, Quantum game theory, Quantum mi- 
nority games, Qutrits, Three level systems, Multipartite entangled states. 



1 Introduction 

Quantum game theory is a fairly recent extension of game theoretical analysis 
to situations formulated in the framework of quantum information theory. The 
first papers appeared in 1999. Meyer showed with a model of a penny-flip game 
that a player making a quantum move always comes out as a winner against a 
player making a classical move regardless of the classical players choice [T] . The 
same year Eisert et al. published a quantum protocol in which they overcame 
the dilemma in Prisoners dilemma 0. In 2003 Benjamin and Hayden general- 
ized Eisert 's protocol to handle multiplayer quantum games and introduced the 
quantum minority game together with a solution for the four player case which 
outperformed the classical randomization strategy [3J. This result was later 
generalized to the n-players by Chen et al. in 2004 [3]. Multiplayer minority 
games has since then been extensively investigated by Flitney et al. [S] El E] - 

We will here extend quantum minority games to situations where there are 
not only multiple players, but also multiple choices. A quantum version of 
the Kolkata restaurant problem, which is a generalized minority game will be 
presented. The players uses maximally entangled qutrits as a quantum resource 
and selects their strategy by locally acting with a general SU(3) operator on the 
qutrit in their possession. 
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1.1 Kolkata restaurant problem 



The Kolkata restaurant problem is a minority-type game [5J 02 EH HH H2] • in 
its most general form TV non-communicating agents (players), have to choose 
between n choices. The agents receive a gain in their utility if their choice is 
not too crowded, i.e the number of agents that made the same choice is under 
some threshold limit. The choices can also have different values of utility as- 
sociated with them, accounting for a preference profile over the set of choices. 
The original formulation comes with a story of workers in Kolkata that during 
lunch hours has to choose between a fixed number of cheap restaurants. Each 
restaurant can only serve a finite number of customers, so workers arriving to a 
crowded restaurant will simply miss the opportunity of having lunch. Often is 
the number of agents taken to be equal to the number of restaurants, and the 
maximum number of costumers per restaurant limited to one. The problem is 
usually modeled as an iterative game where agents ought to base their decision 
on information about the distribution of agents over choices in the previous iter- 
ations. The Kolkata restaurant problem offers therefore a method for modeling 
heard behavior and market dynamics, where visiting a restaurant translates to 
buying a security, in which case an agent wishes to be the only bidder. 



1.2 The model 

In our simplified model there are just three agents, Alice, Bob and Charlie. 
They have three possible choices: security 0, security 1 and security 2. They 
receive a payoff $ of one unit if their choice is unique, i.e that nobody else has 
made the same choice, otherwise they receive $ = 0. The game is so called one 
shoot, which means that it is non-iterative, and the agents have no information 
from previous rounds to base their decisions on. Under the constraint that they 
cannot communicate, there is nothing left to do other than randomizing between 
the choices. Given the symmetric nature of the problem, any deterministic 
strategy would lead all three agents to the same strategy, which in turn would 
mean that all three would leave empty handed. There are 27 different strategy 
profiles possible, i.e combinations of choices. 12 of which gives a payoff of $ = 1 
to each one of them. Randomization gives therefore agent i an expected payoff 
of E c ($) = |, where the superscript denotes that the result is due to the best 
classical strategy (as opposed to quantum strategy). 

In the framework of quantum game theory [T31 HH [HI HB], Alice, Bob and 
Charlie shares a quantum resource. Each has a part of a multipartite quan- 
tum state. They play their strategy by manipulating their own part of the 
combined system, before measuring their subsystems and choosing accordingly. 
Whereas classically the players would be allowed randomizing over a discrete 
set of choices, in the quantum version each subsystem is allowed to be trans- 
formed with the full machinery of quantum operations. A strategy, or choice 
therefore translates to choosing a unitary operator U. In the absence of entan- 
glement, quantum games of this type usually yield the same payoffs as their 
classical counterparts, whereas the combination of unitary operators (or a sub- 
set therein) and entanglement, sometimes strongly outperforms classical games 
and decision theoretic models. We will here present such a case. 
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2 Qutrits and parametrization of SU(3) 



A qutrit is a 3-level quantum system on 3-dimcnsional Hilbcrt space % = C 3 , 
written in the computational basis as: 



|V;> =ao|0) + ai|l) + aa|2) e C 3 , (2.0.1) 

with ao> oil 0,2 £ C and |ao| 2 + |ai| 2 + |a2| 2 = 1. A general ./V-qutrit system \^f) is 
a vector on 3 A '-dimcnsional Hilbcrt space, and is written as a linear combination 
of 3 N orthonormal basis vectors. 
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where 



TV-times 
, * , 

\x N ■■■ xx) = \x N ) ® |a;jv-i) ® ■ ■ ■ ® |sgi) e "H = C 3 ® ... ® C 3 , (2.0.3) 

with € {0, 1,2} and complex coefficients a Xi , obeying ^ \a XN ... Xl \ 2 = 1. 

Single qutrits are transformed with unitary operators U £ SU(3), i.e opera- 
tors from the special unitary group of dimension 3, acting on % as U : % — > %■ 
In a multi-qutrit system, operations on single qutrits are said to be local. They 
affect the state-space of the corresponding qutrit only. The transformation of a 
multi-qutrit state vector under local operations is given by the tensor products 
of the individual operators: 



\Vfin) = U N ® U N -i ® •■■ <g> Ui |* m ) , (2.0.4) 

where |^j„) and |\&/j n ) denotes the initial and final state of the system respec- 
tively. 

There are a number of ways you can parameterize SU(3) Q~5J H5] . One 
common approach is through the Lie algebra of the group, the eight traceless 
3x3 Gcll-Mann matrices. We are using a different and maybe slightly more 
intuitive parametrization |17j . Let x, y, z be three general, mutually orthogonal 
complex unit vectors, such that x ■ y = and x* x y = z. We construct a SU(3) 
matrix by placing x, y* and z as its columns. Now a general complex unit vector 
is given by: 

sin#cos0e IQ; 

sin(9sin<j()e M2 | , (2.0.5) 
cos6»e M3 

and one complex unit vector orthogonal to x is given by: 

(cosxcos#cos(/)e l( -' 3l ~ Ql ' 1 + sin x sin <f)e % ^~ ai > 
cos x cos 6< sin V)e i(/3l_Q2) - sinxcos ^e i(/32 - Q2) |, (2.0.6) 
-cosxsin6le i(/3l - Q3) 

where < (f>,9,x,< n/2 and < ai, a.2, as, /3i, $2 < 27r. We have a general 
SU(3) matrix U, given by: 
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/ xi y\ x* 2 y 3 - y\x 2 
U = I x 2 y* 2 x\y x - y\x-i 



) 



(2.0.7) 



and it is controlled by eight real parameters 0, 9, x, oti,a.2, as, 01,02. 

3 The scheme 

The scheme under study is a development of one first introduced by Eisert et 
al. [2], and later generalized by Benjamin and Hayden [3J. It starts out with 
Alice, Bob and Charlie, A, B and C respectively, sharing a quantum resource, an 
entangled tripartite 3-level quantum state. We need to allow the quantum states 
to have a common origin, since creating entanglement is a global operation, and 
cant be done by acting locally on the subsystems. We assume that there exists an 
unbiased referee that prepares the state and distributes the subsystems among 
the players. From that point on, no communication is allowed between the 
players and the referee. Each qutrit is due to be measured by the player owning 
it, at the end of the protocol in the {|0), |1), |2)} -basis, where basis vector 
corresponds to one of the three choices: security 0, security 1, and security 
2. The players plays their strategy by applying an operator from the set of 
allowed strategies S, followed by a local measurement which determines their 
final choice. The unitary operations done by A, B, C are done locally, which 
means that the operator is applied on the subsystem held by the player. As 
mentioned, this translates to the transformation of |^>j n ) by the tensor product 
of the unitary operators applied by the players. 

We want to create a quantization of the classical game in which we expand 
the set of available strategies to include quantum moves. While we are propos- 
ing a quantum game which in some sense is fundamentally different from the 
classical version, we require it to be an extension, not an addition to the clas- 
sical Kolkata restaurant problem. Tracing the steps of the predecessors of this 
protocol, we restrict our formulation to have the classical game fully present at 
all times, accessible in the form of restrictions on the set of allowed local opera- 
tions. Wc simply require that there exists a set of operators that when applied 
locally on an entangled initial state gives the same outcomes as in the classical 
non-quantum version. Lets first look at the classical game presented with quan- 
tum formalism. Note that there is nothing quantum mechanical happening at 
this point. The initial state \ipi n ) = 1 000) = \0}c <g> \0)b ® \0)a corresponds to 
the case where the three players chooses security 0, by default. The individual 
choices are made by applying operators Si, Sj, st£S = {sq, Si, S2} to each sub- 
system. The exact form of these operators can be left to discuss later. The only 
restriction at the moment is that they obey: so|0) = |0), si\0) = |1), S2 10) = |2), 
resulting in fully deterministic outcomes: 



As mentioned earlier there are 27 different such outcomes, each linked to dif- 
ferent combinations of the operators {sq, si, S2}, 12 of which gives a player a 
payoff $ = 1, and the rest $ = 0. Clearly there are no operators available cor- 
responding to mixed strategies, so randomization processes leading to classical 



Si <g> Sj ® s fc |000) = \ijk). 



(3.0.8) 
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mixed strategies are here lifted outside the protocol and is done by the players 
before the choices are made. Having finished the first step in the quantization 
process our task is now to keep the classical game present throughout the com- 
ing steps while we add quantum structure by choosing an entangled initial state 
and expanding the set of strategy operators to include any U € S = SU(3). 
Since the game is symmetric and unbiased in regards to permutation of player 
positions, then this is a property that has to be true of the initial state \ipi n ), to 
assure that the payoff functions $i(\tpi n ), Ua, Ub, Uc) of all three players i are 
identical up to some permutation of Ua, Ub, Uc- Note that when dealing with 
mixed classical and quantum strategies the payoff function becomes an expec- 
tation value E($) of a probability distribution over the different outcomes. We 
summarize the criteria for choosing an initial state: 

1. The state ought to be entangled, to accommodate for correlated random- 
ization among the players. 

2. It should be symmetric and unbiased in regards to player positions. 

3. It must allow for classical game to be accessed by restrictions on the space 
of available strategy operators. 

The three qutrit GHZ-typc-statc: 



not only fulfills the above criteria, it is also a maximally entangled state on 
H = C 3 <g> C 3 <£) C 3 . It has the additional property of initializing the game in 
an maximally undesired state, i.e. one in which none of the players receives 
any payoff. In order to change their situation, they will have to make an active 
choice. It is left to show that we can define a set of operators corresponding 
to classical pure strategies that gives raise to deterministic classical payoffs 
when applied to the entangled initial state. This problem was addressed by 
Eisert et. al. [2], and further developed by Benjamin et.al. [3] for cases of n 
players and two choices, by defining an entangling operator J and its inverse Jt, 
acting on a n-qubit product state 1 00 • • • 0) with Hcrmitian strategy operators 
so, si, sandwiched in between. By showing that any combination of the classical 
strategies s Xl ® s X2 ® • • • ® s Xn , Xi <G {0, 1} commutes with J, one guarantees 
that the classical game is embedded in the quantum version. That route is 
not possible when formulating a game with aid of higher dimensional quantum 
states like qutrits, since at least two different Lie-algebra elements of su(3) must 
appear in the Hamiltonian of J (For the GHZ-type-state), whereby commutation 
is no longer a fact in the general case. We need a set of operators that replicates 
the classical payoffs when applied directly on our entangled initial state | tpi n }- 
The cyclic group of order three, C3, generated by the matrix: 



where s 3 = s° = I and s 2 = s T , has the properties we are after. The set of 
classical strategies S = {s°,s l ,s*} with s l <£> s J ® s fc |000) = \ijk) acts on the 
GHZ-statc as: 



I Vin) = - 7 =(|000) + |111) + |222)) ! 



(3.0.9) 




(3.0.10) 
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s l ® s 3 ® s fc -^= (| 000) + |111) + |222>) 



v/3 

= (|0 + % + j + k) + |1 + il + j 1 + fe) + |2 + « 2 + j 2 + fc)) . (3.0.11) 
v3 

Note that the superscripts denotes powers of the generator and that the addition 
is modulo 3. In the case under study, where there is no preference profile over 
the different choices, any combination of the operators in S = {s°, s 1 , s 2 } leads 
to the same payoffs when applied to the GHZ-state as to |000). 

Now that an entangled initial state | ipm) is chosen, the scheme for the 
quantum game proceeds as follows. We form a density matrix pi n out of the 
initial state | ipi n ) and add noise that can be controlled by the parameter / [7]. 
We get: 

Pin=f\i>in){ll>in \+^^h 7 , (3.0.12) 

where I27 is the 27 x 27 identity matrix. Alice, Bob and Charlie now applies 
a unitary operator U that maximizes their chances of receiving a payoff $ = 1, 
and thereby the initial state pi„ is transformed into the final state pfi n - 

Pfin = U f ®U f ®U f Pm U ®U ®U. (3.0.13) 

Note that they are all applying the same operator U since in the absence of 
communication, coordination of which operator to be applied by whom, would 
be impossible. We define for each player i a payoff-operator Pi , which contains 
the sum of orthogonal projectors associated with the states for which player i 
receives a payoff $ = 1 . For Alice this would correspond to 

PA = I ^2 \X 3 X2X 1 )(X 3 X 2 X 1 \ 7 X 3 ^ X 2 ,X 3 ^ X l7 X 2 ^ X!] + 

+ \ x 3X2Xl)(x 3 X 2 X 1 \, X 3 = X 2 ^ Xl J . (3.0.14) 

\x 3 , x 2 ,x 1 =0 / 

The expected payoff Ei($) of player i is calculated by taking the trace of the 
product of the final state pfi n and the payoff-operator Pj: 

£?<($) =Tr(Pip fin ). (3.0.15) 



4 Optimal strategy 

The problem now is to find the unitary operator [/(</>, 0, x, &i, oli, a 3 , Pi, /3 2 ) 
that maximizes the expected payoff. Due to the symmetry of the problem, 
optimization can be done with respect to the Pi of any of the three players. 
Doing so one arrives at a maximum expected payoff of E(§) = |, assuming 
(/ = 1), compared to the classical E c (%) = |. Which is an 50% increase. 
This occurs when the players applies the optimal unitary operator U opt , whose 
parameters are listed in table 1. 



G 



Parameter 


<f> 


6 


X 


a.\ 


«2 


a 3 




/?2 


Value 


IT 

4 


COS - 1 (^) 


4 


5tt 

18 


18 


18 


7T 

3 


117T 

6 



Table 1: U opt in the given parametrization. 



Because of the periodic nature of the solution, there could be more than one 
unique choice for some of the parameters within the allowed ranges < cj>,0,x < 
it/2 and <, oti, 0i, 02 < 27r. This is the case for ai, 02,03, where 

maximum expected payoff is achieved for ( 5+ 1 1 8 2 " ) n £ {0, 1, 2}. Noting that 
the the center of SU(3), Z(3) ={I, e ±! ^ L 1} only adds a global phase and leaves 
the density matrix invariant, one concludes that the transformation belongs to 
SU(3)/Z(3). This removes the above ambiguity, ending up with ai, 03 = y|. 

The final state arrived at by playing U opt is given by: 

I V>/m) = I (1000) + |012) + |021) + [102) + 

|111) + 1 120) + |201) + |210) + |222)) . (4.0.16) 

This is an even distribution of all the states that leads to payoff to all three 
players and the states which gives payoff to none and shows that the U opt <g> 
[/ opt Cg>Zy op *-operation fails to make the state fully depart from the space spanned 
by 1 000), I 111), 1 222). This failure accounts for the expected payoff not reaching 
unity. 

Now by setting a±,ct2 =0 and 03 = a, in the parametrization, one arrives 
at a six parameter subset of SU(3), given by operators U (4>, 9, Xi ck, /5i , /?2)- The 
optimum is at the same value as with the transformation belonging to its do- 
main. There is thereby a V opt in this subset, given in table 2 below, that gives 
each player an expected payoff of E($) = | = |. 



Parameter 


<t> 


6 


X 


a 


Pi 


02 


Value 


7T 

4 




TV 

4 


IT 

2 


7T 

3 


5-7T 

6 



Table 2: V opt in the reduced parametrization. 



U opt and V opt differs only by a constant phase factor, so for our purposes, 
what's true of one is true of the other. We will therefore regard the reduced 
parametrization when showing that the solution is a Nash equilibrium in the 
next section. 

If we further reduce the parametrization by letting all phase parameters 
(Xi, as, as, 0i, 02 = 0, we end up with an operator O{4>,0,x) S SO(3), i.e. the 
elements of the special orthogonal group of dimension 3. These operators cor- 
responds to rotations in R 3 . In quantum games with two choices, like quantum 
prisoners dilemma and in minority games, local orthogonal operations merely 
achieves to replicate the results of classical mixed strategies and offers no im- 
provement in the expected payoff, even with a maximally entangled initial state. 
In this case though, there exists an O opt £ SO (3), given in table 3 that outper- 
forms the classical expected payoff by a small margin. Each player would in this 
case receive a payoff of E(%) — |y, compared to the classical E($) = | = ff • 
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This result might open up the possibility of a new classification of quantum 
games, where there could exist a category of quantum games with classical 
strategies that are fundamentally different than classical games with classical 
strategies. 



Parameter 





9 


X 


Value 


7T 


cos- 1 (i) 


7T 

6 



Table 3: O opt in the given parametrization. 



4.1 Nash equilibrium 

To show that this solution is valid from a game-theoretical point of view, we need 
to show that V op t is a Nash equilibrium, i.e. that none of the players gains by uni- 
laterally changing strategy from V opt to any other strategy U (</>, 6, x, a, /Si,^)- 
Without loss of generality, we show for the expected payoff Ea($) of Alice that 
the following inequality holds for any V: 



E A {$){V° pt 



yopt 



V7') > E A ($)(V° pt g> V° pt g> V A ). 



(4.1.1) 



We show that this is the case by letting Alice act with a general unitary operator 
V((j), 6, x, a, /3i,/3 2 ) € SU(3), while Bob and Charlie acts with y op * {U opt ). Then 
we take the partial derivatives of Ea{%) with respect of each of the parameters 
while keeping the rest at optimal value. Vanishing partial derivatives together 
with a negative definite Hessian matrix at the values of V opt proves that V opt is 
Alice's dominant strategy and because of the symmetry of the protocol, thereby 
a Nash equilibrium. 



0E A ($) 



<t>=<P' 



9 



cos(20) 



0, 



d 2 E A ($) 



< 0, (4.1.2) 



dE A ($) 



00 



— (-V3sm(e) + 3\/2cos(2<9) 



+ 



3sin(6») + V&) cos(0) 



d 2 E A ($) 



d9 2 



<0, (4.1.3) 



dE A ($) 



x=x 



2 

27 



(cos(x) - sin(x)) (sin(x) + cos(x) + V2 

d 2 E A ($) 
d\ 2 



x=x 
< 0, (4.1.4) 



dE A ($) 
da 



4 cos(a) 
27 



0, 



d 2 E A (%) 
da 2 



< 0, (4.1.5) 
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dE A ($) 



54 



(-3sin08i)+sin(2j8i)- 



+3V3cos(/3i) + v^cos^i) 



= 0, 



d 2 E A {%) 
dpi 



<0, (4.1.6) 



dE, 



P2=P' 2 



54 



((2sin(/3 2 ) + 3)(-cos(/3 2 ))- 



-V3(3sin(/3 2 )+cos(2/3 2 )) 



0, 



d 2 E A ($) 
d/3 2 2 



<0. (4.1.7) 



By calculating the Hessian H with 

d 2 



Hi. 



dciida. 



E A (%) 



(4.1. 



where ai,a,j <G {(f), 9, x, a, /3 2 }, and confirming that all eigenvalues are nega- 
tive, we conclude that T/ op * {U opt ) is indeed a Nash equilibrium. 



4.2 Adjusting entanglement and fidelity 

We have included a simple model of noise, to show the behavior of the expected 
payoff, when the initial state was adjusted towards a completely mixed state. 
This was done by controlling the fidelity / of the initial state, by mixing it with 
an even distribution of all basis states in T~L = C 3 <E) C 3 <£) C 3 . Clearly as / — >• 0, 
we should expect the entanglement as a resource in the initial state to vanish. 
This is of course the case and we have E($)(U opt J) = (2(2 + /))/9. For / = 
we simply end up with the classical result. 

A way of directly adjusting the strength of entanglement in the initial state, 
while keeping the state pure is to start with 

| Vin) = sin cos 1 000) + sintf sin^|lll) +costf|222), (4-2.1) 

where < ■& < tt and < ip < 2tt. We retrieve the maximally entangled state 
(|3H9| for ip = f , 2zt and d = ± cos _1 (l/V3). The expected payoff is given by: 

E($)(U opt , #,<p) = - (sin(y») sin(2^) + cos(^) (2 sin(^) sin 2 (i?) + sin(2i?)) + 4) , 

(4.2.2) 

which shows that any deviation from maximal entanglement reduces the ex- 
pected payoff towards the classical E c (%), graphically shown in figure 1. 

A point to note here is that the maximum expected payoff strongly depends 
on the choice of initial state \ipi n ), and that there can exist more or less suitable 
initial states depending on the task. We chose the GHZ-state for this protocol 
because it is an unbiased maximally entangled state, which lets the classical 
game be present and accessible trough restrictions on S. Would our preferences 
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Figure 1: Expected payoff E($)(U opt , as a function of and y at the Nash 
equilibrium strategy. 

been different and we had chosen for example the antisymmetric Aharonov state 
instead: 



where e XsX2X1 is the completely antisymmetric tensor, then the expected payoff 
would have been E($) = 1, just by letting the players apply the identity oper- 
ator. This state would guarantee that everybody ends up with a unique choice 
every time. But that wouldn't be of any interest from a game theoretical per- 
spective since outcomes would have resembled a classical game with unrestricted 
communication. However, due to the the invariance of \A-) under local unitary 
transformations of the form U ®V ®U , superpositions of \A-) and |000) under 
some restricted set of operators resembling the set of mixed classical strategies, 
could model a classical game under different amounts of communication. 

5 Conclusions 

We have created the first quantum model for a three player, three restaurant 
Kolkata restaurant problem. We have shown that when the players share an 
initial tripartite entangled state, there exists a local unitary operation for which 
the players can increase their expected payoff E($) by 50% compared with 
classical randomization. This solution is a Nash equilibrium and therefore a 
natural attractor in the space of available strategies. The achievement of this 
performance is highly dependent on the strength of entanglement and the fidelity 
of the initial state. 

Acknowledgments: We wish to thank Ole Andersson for valuable inputs 
and fruitful discussions. This study was supported by the Swedish Research 
Council (VR). 




(4.2.3) 
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